Construction of Thai Lexicon from Existing Dictionaries and Texts on the Web

نویسندگان

  • Thatsanee Charoenporn
  • Canasai Kruengkrai
  • Thanaruk Theeramunkong
  • Virach Sornlertlamvanich
چکیده

THING ORGANIZATION PERSON 0.69 0.04 0.03 สังคม ‘society’ รัฐบาล ‘government’ นักทองเท่ียว ‘tourist’ Subject of ซื้อ ‘buy’ PERSON CONSTRUCTION ORGANIZATION 0.40 0.35 0.25 ชาวบาน ‘villager’ โรงพยาบาล ‘hospital’ บริษัท ‘company’ Subject of จาย ‘pay’ PERSON CONSTRUCTION CULTURAL_AB_THING 0.54 0.39 0.08 นักเรียน ‘student’ ธนาคาร ‘bank’ ประธาน ‘chairman’ Table 3 Generalization results with subject-verbrelationship. Class Prob. Word Example Direct Object of ตรวจ ‘check’ ARTIFACT ABSTRACT_THING ANIMAL_PART 0.34 0.22 0.18 รางวัล ‘prize’ เอกสาร ‘document’ รางกาย ‘body’ Direct Object of สราง ‘build’

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On multiword lexical units and their role in maritime dictionaries

Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...

متن کامل

The Effect of Lexicon-based Debates on the Felicity of Lexical Equivalents in Translating Literary Texts by Iranian EFL Learners

This study was an attempt to investigate the effect of lexicon-based debates on the felicity of lexical equivalents in translating literary texts by Iranian EFL learners.  To fulfill the purpose of this study, 59 university students, majoring in English Translation, were randomly assigned to the experimental and control groups from a total of 73 students based on their performance on a mock TOE...

متن کامل

Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping

Information extraction systems usually require two dictionaries: a semantic lexicon and a dictionary of extraction patterns for the domain. We present a multilevel bootstrapping algorithm that generates both the semantic lexicon and extraction patterns simultaneously. As input, our technique requires only unannotated training texts and a handful of seed words for a category. We use a mutual boo...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

A Supervised Method for Constructing Sentiment Lexicon in Persian Language

Due to the increasing growth of digital content on the internet and social media, sentiment analysis problem is one of the emerging fields. This problem deals with information extraction and knowledge discovery from textual data using natural language processing has attracted the attention of many researchers. Construction of sentiment lexicon as a valuable language resource is a one of the imp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEICE Transactions

دوره 89-D  شماره 

صفحات  -

تاریخ انتشار 2006